Increase running pod memory limit for rapid_appends to prevent cgroup OOM by yaozile123 · Pull Request #1282 · GoogleCloudPlatform/gcs-fuse-csi-driver

yaozile123 · 2026-03-27T22:59:27Z

What type of PR is this?

Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line:

/kind failing-test

What this PR does / why we need it:
The rapid_appends GCSFuse integration tests fail with exit code 137 in file cache environments due to reaching the container's memory limit (Killed by cgroup OOM). This happens because the file cache suites default the volume-tester pod to a 1Gi limit, while compiling and executing the rapid_appends package can peak at around 1Gi of memory usage.

This PR updates configureLargeFileResources to ensure that rapid_appends tests bump the volume-tester pod to a standard 3Gi memory limit, preventing hard limit breaches.

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:
Tested on managaed driver with ZB enabled

make e2e-test E2E_TEST_USE_GKE_MANAGED_DRIVER=true ENABLE_ZB=true E2E_TEST_FOCUS=rapid_appends

Ran 8 of 430 Specs in 1421.077 seconds
SUCCESS! -- 8 Passed | 0 Failed | 0 Pending | 422 Skipped


Ginkgo ran 1 suite in 23m50.099545872s
Test Suite Passed

… OOM

google-oss-prow · 2026-03-27T22:59:30Z

@yaozile123: The label(s) kind/failing-test cannot be applied, because the repository doesn't have them.

Details

In response to this:

What type of PR is this?

Uncomment only one /kind <> line, hit enter to put that in a new line, and remove leading whitespaces from that line:

/kind failing-test

What this PR does / why we need it:
The rapid_appends GCSFuse integration tests fail with exit code 137 in file cache environments due to reaching the container's memory limit (Killed by cgroup OOM). This happens because the file cache suites default the volume-tester pod to a 1Gi limit, while compiling and executing the rapid_appends package can peak at around ~934MB of RSS.

This PR updates configureLargeFileResources to ensure that rapid_appends tests bump the volume-tester pod to a standard 3Gi memory limit, preventing hard limit breaches.

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:
Tested on managaed driver with ZB enabled
make e2e-test E2E_TEST_USE_GKE_MANAGED_DRIVER=true ENABLE_ZB=true E2E_TEST_FOCUS=rapid_appends

Ran 8 of 430 Specs in 1421.077 seconds
SUCCESS! -- 8 Passed | 0 Failed | 0 Pending | 422 Skipped


Ginkgo ran 1 suite in 23m50.099545872s
Test Suite Passed

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

google-oss-prow · 2026-03-27T22:59:30Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

gemini-code-assist

Code Review

This pull request modifies the configureLargeFileResources function in the GCS Fuse integration test suite to set resource requirements for the test pod when performing rapid append tests. Feedback was provided regarding a mismatch in the memory limit, suggesting it be adjusted to 3Gi for consistency with the sidecar container's configuration.

test/e2e/testsuites/gcsfuse_integration.go

google-oss-prow · 2026-03-30T17:12:03Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: amacaskill, yaozile123

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

increase running pod memory limit for rapid_appends to prevent cgroup…

f6e25a3

… OOM

gemini-code-assist bot reviewed Mar 27, 2026

View reviewed changes

test/e2e/testsuites/gcsfuse_integration.go Show resolved Hide resolved

yaozile123 marked this pull request as ready for review March 28, 2026 00:06

yaozile123 requested review from Sneha-at and amacaskill March 28, 2026 00:06

amacaskill approved these changes Mar 30, 2026

View reviewed changes

google-oss-prow bot assigned amacaskill Mar 30, 2026

google-oss-prow bot added the lgtm label Mar 30, 2026

yaozile123 merged commit 668ba34 into GoogleCloudPlatform:main Mar 30, 2026
8 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase running pod memory limit for rapid_appends to prevent cgroup OOM#1282

Increase running pod memory limit for rapid_appends to prevent cgroup OOM#1282
yaozile123 merged 1 commit intoGoogleCloudPlatform:mainfrom
yaozile123:fix-rapid-appends-oom

yaozile123 commented Mar 27, 2026 •

edited

Loading

Uh oh!

google-oss-prow bot commented Mar 27, 2026

Uh oh!

google-oss-prow bot commented Mar 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

google-oss-prow bot commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yaozile123 commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-oss-prow bot commented Mar 27, 2026

Uh oh!

google-oss-prow bot commented Mar 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

google-oss-prow bot commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yaozile123 commented Mar 27, 2026 •

edited

Loading